Twitter User Classification using Ambient Metadata
نویسندگان
چکیده
Microblogging websites, especially Twitter have become an important means of communication, in todays time. Often these services have been found to be faster than conventional news services. With millions of users, a need was felt to classify users based on ambient metadata associated with their user accounts. We particularly look at the effectiveness of the ‘profile description’ field in order to carry out the task of user classification. Our results show that such metadata can be an effective feature for any classification task.
منابع مشابه
A High-Performance Model based on Ensembles for Twitter Sentiment Classification
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...
متن کاملClassification of Twitter Users Who Tweet About E-Cigarettes
BACKGROUND Despite concerns about their health risks, e‑cigarettes have gained popularity in recent years. Concurrent with the recent increase in e‑cigarette use, social media sites such as Twitter have become a common platform for sharing information about e-cigarettes and to promote marketing of e‑cigarettes. Monitoring the trends in e‑cigarette-related social media activity requires timely a...
متن کاملA Stacking-based Approach to Twitter User Geolocation Prediction
We implement a city-level geolocation prediction system for Twitter users. The system infers a user’s location based on both tweet text and user-declared metadata using a stacking approach. We demonstrate that the stacking method substantially outperforms benchmark methods, achieving 49% accuracy on a benchmark dataset. We further evaluate our method on a recent crawl of Twitter data to investi...
متن کاملOn-the-fly Detection of Autogenerated Tweets
Most previous work related to tweet classification have focused on identifying a given tweet as a spam, or to classify a Twitter user account as a spammer or a bot. In most cases the tweet classification has taken place offline, on a pre-collected dataset of tweets. In this paper we present an on-the-fly approach to classify each newly downloaded tweet as autogenerated or not. We define an auto...
متن کاملPredicting age groups of Twitter users based on language and metadata features
Health organizations are increasingly using social media, such as Twitter, to disseminate health messages to target audiences. Determining the extent to which the target audience (e.g., age groups) was reached is critical to evaluating the impact of social media education campaigns. The main objective of this study was to examine the separate and joint predictive validity of linguistic and meta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1407.8499 شماره
صفحات -
تاریخ انتشار 2014